A First Investigation on Mongolian Information Retrieval

نویسندگان

  • Guanglai Gao
  • Wei Jin
  • Fei Long
  • Hongxu Hou
چکیده

In this paper we present an attempt to build a test collection for Mongolian IR as well as some preliminary tests about the key issues in Mongolian Information Retrieval: using a stoplist and using word stemming. Our preliminary tests will show that while these basic operations on Mongolian can bring slight improvements in retrieval effectiveness, many problems remain. The results using stemming and stoplist show that the stemming can potentially lead to some gain in retrieval effectiveness; The stoplist slightly improve retrieval effectiveness, but it can reduce the index significantly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Realization of Mongolian Syntactic Retrieval System Based on Dependency Treebank

In the past seven years, Language Research Institute of Inner Mongolia University has constructed a 500,000word scale Mongolian dependency treebank. The syntactic treebank provides a favorable data platform for language research and information processing. In order to effectively use the treebank, we have designed and implemented a graphical syntactic information retrieval system based on the M...

متن کامل

The Study on Key Technology of Mongolian Full-Text Retrieval

With the development of the Mongolian corpus and website, an increasing number of people have focused their attention on the accurate, complete and fast retrieval of the information that they need. In this paper, such key technological issues in Mongolian full-text retrieval as character shape indexing, drawing the Mongolian verb stem and the automatic recognition of the Mongolian homographic w...

متن کامل

Research on Reasoning and Retrieval Methods Based on Mongolian Curriculum Areas of Semantic Web

The backwardness of the Mongolian network teaching resources results in its low reuse rates and utilization. For this situation, a retrieval method of semantic web based on Mongolian curriculum areas was set up. Firstly, the method established the Mongolian ontology of course ‘Artificial Intelligence ( )’in area of teaching, it uses a relationship database MySQL to record ontology information, ...

متن کامل

A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval

In Modern Mongolian, a content word can be inflected when concatenated with suffixes. Identifying the original forms of content words is crucial for natural language processing and information retrieval. We propose a lemmatization method for Modern Mongolian and apply our method to indexing for information retrieval. We use technical abstracts to show the effectiveness of our method experimenta...

متن کامل

The Semantic Annotation Based on Mongolian Place Recognition

The Mongolian semantic description is the chief problem in construction of Mongolian semantic web. Particularly, more and more Mongolian websites have been created in recent years, and the number of users is sharply increasing, both of them demand for higher quality of Mongolian information retrieval. This paper is based on the study of Mongolian place name recognition and semantic annotation. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008